Getting Back Up: Understanding How Enterprise Data Backups Fail
نویسندگان
چکیده
In the enterprise world, retaining data backups is the de-facto solution against data loss in the event of catastrophic failures. As backup software evolves to achieve faster backup and recovery times, however, backup systems deploying it become increasingly complex to administer. This complexity stems from optimizations targeted to specific applications, which increase the number of configuration parameters for the system. Still, there is no work in the literature that attempts to study the error characteristics of enterprise backup systems, despite our reliance on the guarantees they provide. With this study we aim to help researchers and practitioners understand how backup system jobs fail, and identify factors that can be used to predict these failures. Our results are derived from an analysis of data on 775 million jobs, collected from more than 20,000 backup software installations over a span of 3 years. We confirm that trends reported in the software reliability literature also hold for backup systems, such as that the majority of job errors are due to misconfigurations. For the systems in our dataset, we find that error rates remain stable across software versions and over time. To better understand these errors, we investigate the effect of several factors on the system’s error rate, such as job sizes and policy complexity, and demonstrate their predictive power for future errors.
منابع مشابه
Evaluating the Evaluator: Towards understanding Feed-back, Feed-up, and Feed-forward of Moroccan Doctorate Supervisors’ Reports
Supervisor’s feedback is both a naysaying and a puzzling concern that has always tormented academics in higher education. Particularly, written feedback on pre-final or final versions of a submitted doctoral dissertation is indisputably the most significant step toward granting a doctoral student supervisee the right to defend his/her research project. It also constitutes a rich source on how s...
متن کاملeCryptfs: An Enterprise-class Encrypted Filesystem for Linux
eCryptfs is a cryptographic filesystem for Linux that stacks on top of existing filesystems. It provides functionality similar to that of GnuPG, except the process of encrypting and decrypting the data is done transparently from the perspective of the application. eCryptfs leverages the recently introduced Linux kernel keyring service, the kernel cryptographic API, the Linux Pluggable Authentic...
متن کاملPerformance of a Parallel Network Backup Manager
The advent of inexpensive multi-gigabyte tape drives has made possible the completely automated backup of many dozens of networked workstations to a single tape. One pioblem that arises with this scheme is that many computers cannot backup theiì disks óver the network at more than a f¡action of the tape's rated speed. Thus, running overnight backups sequentially can take well into the next day....
متن کاملEffect of Remote Back-Up Protection System Failure on the Optimum Routine Test Time Interval of Power System Protection
Appropriate operation of protection system is one of the effective factors to have a desirable reliability in power systems, which vitally needs routine test of protection system. Precise determination of optimum routine test time interval (ORTTI) plays a vital role in predicting the maintenance costs of protection system. In the most previous studies, ORTTI has been determined while remote bac...
متن کاملTool Support and Data Management for Business Analytics Applications in Healthcare
The data delivery architectures in most enterprises are complex and under documented. When a healthcare manager looks at a report, they want to know exactly what each element or technical expression on a report means, where the values shown originate from and how often they are getting updated. We propose a tool framework that includes: a metadata repository for enterprise data consolidated in ...
متن کامل